The wraetlic NLP suite
نویسندگان
چکیده
In this paper, we describe the second release of a suite of language analysers, developed over the last five years, called wraetlic, which includes tools for several partial parsing tasks, both for English and Spanish. It has been successfully used in fields such as Information Extraction, thesaurus acquisition, Text Summarisation and Computer Assisted Assessment.
منابع مشابه
NAACL HLT 2009 Software Engineering , Testing , and Quality Assurance for Natural Language Processing ( SETQA - NLP 2009 )
We summarize our experiences building a comprehensive suite of tests for a statistical natural language processing toolkit, ClearTK. We describe some of the challenges we encountered, introduce a software project that emerged from these efforts, summarize our resulting test suite, and discuss some of the les-
متن کاملImplementing a Task Specific Grammar for Recognition and Parsing using the CPK NLP Suite for Spoken Language Understanding
This paper describes how a task specific grammar can be implemented using a dedicated “NLP” Augmented Phrase Structure (APS) grammar formalism. The APS is used for generation of appropriate semantic frames to be passed on to the dialogue manager of a spoken dialogue system. In a derived form, conforming to the HTK Standard Lattice format, the same APS may be used for constraining the approved s...
متن کاملEDISON: Feature Extraction for NLP, Simplified
When designing Natural Language Processing (NLP) applications that use Machine Learning (ML) techniques, feature extraction becomes a significant part of the development effort, whether developing a new application or attempting to reproduce results reported for existing NLP tasks. We present EDISON, a Java library of feature generation functions used in a suite of state-of-the-art NLP tools, b...
متن کاملMACAON An NLP Tool Suite for Processing Word Lattices
MACAON is a tool suite for standard NLP tasks developed for French. MACAON has been designed to process both human-produced text and highly ambiguous word-lattices produced by NLP tools. MACAON is made of several native modules for common tasks such as a tokenization, a part-of-speech tagging or syntactic parsing, all communicating with each other through XML files . In addition, exchange proto...
متن کاملA Test Suite for Inference Involving Adjectives
Recently, most of the research in NLP has concentrated on the creation of applications coping with textual entailment. However, there still exist very few resources for the evaluation of such applications. We argue that the reason for this resides not only in the novelty of the research field but also and mainly in the difficulty of defining the linguistic phenomena which are responsible for in...
متن کامل